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SYSTEM AND METHOD FOR POPULATING 
FORMS WITH PREVIOUSLY USED DATA 
VALUES 

TECHNICAL HELD 

This invention relates generally to computer- implemented 
processing of data entry forms, such as HTML-generated 
forms on Internet web pages. More particularly, the inven- 
tion provides a method and apparatus for automatically 
populating data fields in forms using data values previously 
specified by a user 

BACKGROUND OF THE INVENTION 

Computer systems conventionally display forms with 
fields into which a tiser enters information such as a name, 
birth date, password, and the like. Modern browsers display 
forms by rendering Hyper Text Markup Language (HTML) 
to generate fields arranged in a particular structure that can 
be populated by a user. Web sites that accept shopping orders 
from on-Une customers, for example, generate forms requir- 
ing entry of the customer's name, address, telephone 
number, and credit card information. Usually, the user must 
repeatedly enter this information each time a site is visited. 
Although information entered by the user is stored on the 
web site, the form does not retain the information for future 
use if the web site is revisited. 

Some web sites can recognize previous customers and 
thus avoid re-prompting for the same information on a 
subsequent visit. Nonetheless, if the user visits a new web 
site that he or she has never before visited, the same 
information must be re-entered on a different form generated 
by the different web site. Much of the information requested 
on these forms is redundant or readily available from other 
sources (e.g., name and address), yet the creators of different 
forms generally have no easy way to share information 
previously entered by the user on an earlier form. Privacy 
issues have thwarted many potential solutions to this 
problem, and it is cumbersome for web site designers to 
include special logic on their web site to recognize previous 
visitors to the site. 

So-called "cookies" (small data files stored by a web site 
on the user's local computer) are sometimes used to retain 
information locally that can be recalled later by a web site 
that the user has previously visited. Such "cookies," 
however, vary widely from site to site, and require cumber- 
some programming logic on each web site to implement 
them. Moreover, users can block the storage of these 
cookies, and users may be generally suspicious of their use 
by untrusted web sites. 

One attempt to solve some of these problems was a prior 
art feature included in the Microsoft Internet Explorer 4.0 
product known as a "profile assistant." This feature made it 
easier for web sites to retrieve registration and demographic 
information from users who had previously provided that 
information. Frequently used information such as user name, 
address, and the like was stored securely in protected storage 
on the client computer. Web servers could request to read 
this information, but it was shared only if users gave their 
consent in a pop-up request box each time a site was visited. 

While the profile assistant provided a potential solution to 
the aforementioned problems, in practice it required that 
each web site write script to request information from the 
user's stored information. If the user declined to grant 
permission to share the information, the solution was effec- 
tively thwarted. It was also inconvenient and time consum- 
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ing for the user to complete a full profile and store it on the 
user's machine. Finally, some users viewed the function as 
intrusive because it required immediate \iser input to confirm 
that the feature ^ould be enabled each time a web site was 
S visited. 

A prior art data sdiema known as the "vCard" schema has 
been used for certain frequently referenced data fields across 
application programs. This schema established standardized 
field identifiers that were to be used for the same data fields, 

10 and was intended to facilitate the transfer of personal profile 
information among applications. For example, the field 
identifier "vCard.FirstName'' was reserved as a field iden- 
tifier for storing a user's first name, regardless of the form 
or application program into which the user's name was to be 

15 entered. (The user would typically only see a label such as 
"First Name.") This schema does not, however, address the 
aforementioned problems. As one example, it is difficult to 
force millions of web sites to conform to standard field 
identifiers or to retrofit existing web pages to the existing 

20 schema. 

The prior art provides tools to stiggest previously used 
values to a computer user when prompting the user for 
information. For example, some e-mail programs suggest 
possible recipient names in the "to" field which match 

^ previously stored user names. When the user types the first 
character of a recipient's name, a possible choice that 
matches the first character appears in the field. As another 
example, well-known Internet browsers provide a user with 
a pull-down menu of choices in a browser's address field, 
such that the user can review previously used web site 
addresses in order to select an address. 

These conventional techniques, however, suffer from 
many of the same disadvantages as the aforementioned 
solutions. The application program itself (i.e., the e-mail 
program) must be specially modified to support the feature, 
and previously used field values cannot be shared among 
other application programs on the same computer unless 
those applications are also modified. Moreover, all applica- 
tion programs would need to adopt standard field identifiers 
in order for the scheme to work properly. 

For most web fonms there is no deterministic way to 
associate a given field label with its corresponding a text 
entry area (i.e., labels used on web forms are not linked to 

45 field identifiers on the page). For example, a web page that 
displays the word "Name" next to a text input box invites the 
user to enter his or her name into the text input box. 
However, there is no easy way for software reading the text 
entry box to associate the "Name" label with the text input 

50 box, since field labels are not intrinsically linked to other 
field attributes such as field name and data type. 
Consequently, while field labels might provide an attractive 
basis for correlating similar fields across forms, there is no 
easy way for the underlying software to identify and corre- 

55 late these labels with values entered by the user. 

In summary, Interact web pages containing form fields 
create special problems, because each web site defines the 
format and behavior of its own forms, and there is no easy 
way to share or suggest previously entered data values 

60 across different web sites or servers. Moreover, because of 
privacy concerns, sharing previously entered form values for 
different web sites may be undesirable or even impossible in 
many cases. 

SUMMARY OF THE INVENTION 

The present invention overcomes many of the foregoing 
problems by providing a method and apparatus that assists 
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a user in storing into a profile data values entered on a form tures. The system memory includes read only memory 

on the basis of labels associated with fields on the form. (ROM) 140 and random access memory (RAM) 150. 

When the user displays a form having the same or similar a basic input/output system 160 (BIOS), containing the 

field labels, a matching process suggests data values for basic routines that help to transfer information between 

fields on the form. According to one embodiment, the user 5 elements within the computer 100, such as during start-up, 

can initiate this process by clicking on an "autofill form" is stored in ROM 140. Computer 100 also includes a hard 

button. disk drive 170 for reading firom and writing to a hard disk 

A web browser employing various inventive principles (not shown), a magnetic disk drive 180 for reading from or 

hcuristically associates a label with a corresponding text writing to a removable magnetic disk 190, and an optical 

input box and, based on the association, populates the text disk drive 191 for reading from or writing to a removable 

input box with a previously stored field value. The match can optical disk 192 sudi as a CD ROM or other optical media. 

be performed on the basis of a dictionary of strings repre- The hard disk drive 170, magnetic disk drive 180, and 

senting common labels in order to retrieve previously used optical disk drive 191 are connected to the system bus 130 

values. An algorithm uses a hierarchical searching method to by a hard disk drive interface 192, a magnetic disk drive 

intelligently choose likely label candidates for a given field ^5 interface 193, and an optical disk drive interface 194, 

on a web page and confirms that choice by matching it respectively. The drives and their associated computer- 

against a previously stored dictionary of potential field readable media provide nonvolatile storage of computer 

labels. Other features include the ability to match fields with readable instructions, data structures, program modules and 

a multi-lingual dictionary; color coding of automatically other data for the personal computer 100. It will be appre- 

populatcd field values for ease of use; and an initial profile cialed by those skilled in the art that other types of computer 

creation step that matches values extracted from a populated readable media which can store data that is accessible by a 

form to a basic set of field labels. computer, such as magnetic cassettes, flash memory cards, 

digital video disks, Bernoulli cartridges, random access 

BRIEF DESCRIPTION OF THE DRAWINGS memories (RAMs), read only memories (ROMs), and the 

„^ ^ , . , , ^ like, may also be used in the exemplary operating environ- 

FIG. 1 shows a conventional general-purpose computmg ^^^^ 

environment that can be employed in various embodiments ' ^ . . ^ ^ 

f ih v t nn A number of program modules can be stored on the hard 

"^^^IT? ^ u u J 1 ■ disk, magnetic disk 190, optical disk 192, ROM 140 or 

no. 2 shows a distributed web-based system employing ^ including an operating system 195, one or more 


application programs 196, other program modules 197, and 


various principles of the invention. 30 

FIG. 3 shows a form comprising field labels and corre- program data 198. Any of the inventTve principles described 

sponding data entry areas rendered using a document herein can be implemented in software and stored on any of 

browser such as a web browser. t^e aforementioned storage devices. 

FIG. 4 shows the form ofFIG. 3 after it has been partially ^ ^.^n enter commands and information into the 

filled in by a user. computer 100 through input devices such as a keyboard 101 

FIG. 5 shows an "autofilF* dialog box that prompts the and pointing device 102. Other input devices (not shown) 

user to fill autofill preferences to be used on subsequently may include a microphone, joystick, game pad, satellite 

displayed forms. dish, scanner, or the like. These and other input devices are 

HG. 6 shows an autofill preferences form that permits the often connected to the processing unit 110 through a serial 

user to provide additional information to be used on subse- port interface 106 that is coupled to the system bus, but may 

quently displayed forms. be connected by other interfaces, such as a parallel port, 

no. 7 shows the form of FIG. 3 with certain fields game port or a universal serial bus (USB). A monitor 107 or 

highlighted to indicate that they contain automaticaUy sug- other type of display device is also connected to the system 

ested values 45 interface, such as a video adapter 108. In 

^^If^ « . r .t. ^ u u- i_ addition to the monitor, personal computers typically 

FIG. 8 shows a different form with certain fields high- .^^j^^^ ^^^^^ ^ j ^^^.^^ ^^^^ ^^^^^^ ^^^^ 

hghted to mdicate that they contain automaUcally suggested speakers and printers, 
values 

^ iLii .J. The computer 100 can operate in a networked environ- 

F^G 9Ashows two form fields including labels located to ^^^^ ^ing logical connections to one or more remote 

the left of the correspondmg data entry areas. computers, such as a remote computer 109. Remote com- 

HG. 9B shows two form fields including labels located p^^.^ 109 can be a server, a router, a network PC, a peer 

above the corresponding data entry areas. device or other common network node, and typically 

FIG. 10 shows a scries of steps that can be carried out in includes many or all of the elements described above 

accordance with one embodiment of the invention. 55 relative to computer 100, although only a memory storage 

device 111 has been illustrated in FIG. 1. The logical 

DETAILED DESCRIPTION OF THE connections depicted in FIG. 1 include a local area network 

INVENTION (LAN) 112 and a wide area network (WAN) 113. Such 

no. lis a schematic diagram of a conventional general- networking environments arc commonplace in offices, 

purpose digital computing environment that can be used to 60 enterprise-wide computer networks, intranets and the Inter- 

implement various aspects of the invention. Computer 100 net. 

includes a processing unit HO, a system memory 120, and When used in a LAN networking environment, the com- 

a system bus 130 that couples system components including puter 100 is connected to the local network 112 through a 

the system memory to the processing unit HO. The system network interface or adapter 114. When used in a WAN 

bus 130 may be any of several types of bus structures 65 networking environment, the personal computer 100 typi- 

including a memory bus or memory controller, a peripheral cally includes a modem 115 or other means for establishing 

bus, and a local bus using any of a variety of bus architec- a communications over the wide area network 113, such as 
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the Internet. The modem 115, which may be internal or A field label corresponds to what the user will actually see 

external, is connected to the system bus 130 via the serial on the display, while the field identifier or field name is 

port interface 106. In a networked environment, program frequently not displayed but permits software to identify the 

modules depicted reladve to the personal computer 100, or field. For example, a field label for a user's name might be 
portions thereof, may be stored in the remote memory 5 displayed as "NAME," while the field identifier, visible to 

storace device underlying software, might be "user_last_name." 

. , . . J . As explained in more detail below, field labels are typically 

It wiU be appreciated that the nctworic connections shown ^^^^^ ^ ^ ^^^^^^^^ ^ g^j^ ^^^^^ ^^g^^^ 

are exemplary and other means of estabhshmg a commum- separately and can be placed in any number of locations near 

cations link between the computers can be used. The exist- ^^^^ ^^^^ 

ence of any of various weU-known^^^^^^ ^^^l!'' '° As shown in FIG. 2, forms 250 and 260 require overlap- 

Ethernet, FTP, HTTP and the like is presumed, and the information; namely, the name and address of the 

system can be operated m a cbenNserver configuration to ^ ^^^^ ^^^^^^^ ^^^^ 

permit a user to retrieve web pages from a web-based server. ^^^^^ ^^^^.^ ^.^ 

Any of vanous convenUonal web browsers can be used to ^^^^^^^ ^^^^^^ ^j^^ ^-^^ ^^^^ ^^.^^ ^^^^ 

display and manipulate data on web pages. 15 ^^^^^^^^^ ^^^^ ^.^^^ ^^^j^^ «^^p^„ ^^^^^^^ 

HG. 2 shows a web-based distnbuted system employing browser 205 to perform various functions on client computer 

various principles of the present invention. As shown in FIG. 204 in connection with the processing of forms. 

2, a client computer 204 is coupled to a first web server 201 According to some embodiments of the invention, data 

and a second web server 202 through a network such as the ^^^^^^ ^^^^^^^ extracted and used to create 

Internet. Client computer 204 includes a conventional web ^ "profile" for the user, such that subsequently displayed 

browser 205 that has been modified in accordance with ^^^^ containing simQar field labels can be automatically 

various principles of the present invention. As is populated at the touch of a button. For example, the first time 

conventional, the user of client computer 204 can retrieve ^^^^ ^^^^ ^j-^j^j computer 204 visits web site 201 and 

web pages from web servers through the Internet using ^^^^^^ ^ ^^^^^ ^dd^ess, and telephone number into form 

HTTP protocols, and can display web pages usmg HTML 250, modified web browser 205 associates the values 

syntax. Application programs that operate withm the context entered by the user with field labels appearing near the 

of modified web browser 205 will automaUcaUy obtam the ^^^^^^ ^^^^^ ^^j^^^ -^^^ ^ ^^^^ structure 206 for 

benefits of the inventive principles described herem without ^^^^^ Consequently, when the user visits different web 

modification. Alternatively, any applicaUon program can be 202 and displays different form 260, web browser 205 

created or modified to operate according to the mventive ^^^^^^ ^^^^ ^f the field labels appearing on form 

principles. 260 to previously stored values (using, for example, a 

As shown in FIG. 2, each web server has an associated "dictionary" 207). These features preferably operate inde- 

Universal Resource Locator (URL) that uniquely identifies pendenlly of any logic at each web site, so that the automatic 

the server to client computers on the Internet. For illustration population of data values occurs without rcprogramming 

purposes, first web server 201 has a URL of www.onc.com existing web sites, and without requiring the storage of 

and second web server 202 has a URL of www.two.com. previously used data values at each web site. 

Web servers are sometimes referred to as "web sites," and Qne technique for obtaining an initial set of data values to 

those terms are used interchangeably herein. Each web site ^e used for populating subsequent forms is to prompt the 

may contain one or more web pages that can be linked and ^ggj. yjg^ ^^d complete an "autofill profile" the first time 

retrieved using conventional protocols, Web pages can also ^ ^ggd. For example, when the user first displays a 

be stored locally on client computer 204. form including certain common field labels (e.g., name, 

Web pages can include forms that permit a user to enter address, and the like), a profile generator function 205c 

data into predefined areas (e.g., rectangular boxes, radio extracts values entered by the user, uses the values to 
buttons, and the like). The forms are defined using HTML 45 populate a standard "autofill profile" form 203, and prompts 

syntax and stored in files as web pages. Document browsers the user to fill in any missing values. As shown in FIG. 2, for 

display forms by interpreting or "rendering" the HTML and example, when the user first visits web site 201 and displays 

generating display commands that permit a user to navigate form 250, after entering his or her name, address, and 

among the predefined areas and enter information. After a telephone number, modified web browser 205 determines 
user has entered information into the appropriate data entry 50 that no autofill profile has been created for the user. When 

areas, the user typically clicks on a "submit" button that the user attempts to submit the completed form, profile 

causes the browser to extract data values from the form and generator function 205c extracts the name, address, and 

transmit them to the server from which the form originated. phone number entered by the user, fills out the correspond- 

A web page can include multiple forms of various types and ing fields in autofill profile 203 by matching field labels in 
formats. 55 form 250 with those in autofill profile 203, and prompts the 

As shown in FIG. 2, web site 201 includes a first form 250 user to fill in missing data items such as e-mail. When the 

and second web site 202 includes a second form 260. Each user has completed the user profile, the completed form is 

form includes a plurality of data entry fields having an saved and used as the basis for populating future fonms. 

associated field label (e.g., name, address, phone), a field Consequently, when the user visits a site that requests an 
identifier (usually hidden from view), and a corresponding 60 e-mail address, modified web browser 205 uses the value 

display region into which the user can type information previously provided in the autofill profile, 

when the web page is displayed using a conventional web The aforementioned feature is similar to the prior art 

browser. For example, an on-line shopping service may "profile assistant" feature, but differs in two important 

require that a customer enter his name, address and tele- respects. First, the autofill profile can be populated auto- 
phone number to process an order. Similarly, a government 65 matically for the user based on a form the user has already 

agency may require that a citizen enter his or her name, used, rather than requiring that the user enter values in a 

address, and date of birth. blank autofill form. Second, the basis for detecting matches 
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between previously xised field values is, according to the requested by the form. As is conventional, each field input 

present invention, field labels that are displayed near each value appears in a text box on the display, and when the form 

text input box, rather than field identifiers that may vary is completed, the user clicks on a "submit order"' button (not 

widely from form to form. Thus, for example, the principles shown in HG. 4) to submit the completed form to the web 
of the present invention would detect matches between text 5 site that will process the order. In accordance with various 

boxes on different forms where each text box was located embodiments of the present invention, if no "aulofiU profile" 

near a label comprising the string "name." Other approaches ^as yet been defined by the t^er, the user is invited to 

that rely on matching field identifiers, an intrinsic attribute ^^'^^]f^%''' f^'^^ " P/°^^" '^'^A'^fK'''' '"^''""k 

of a form element, would fail if one form identified the field ^°™) ^ 

as "user^namc" and another identified a similar field as lO ^^X^^ . ^ ... . . 

"unam." In other words, the present inventors believe that ^ ^t^^^s * ^^^^8 ^^^^ ^ presented to the u^r 

web forms are more likely to have similar field labels (i.e., ^e chcks on a "submit foira" button where no autofill 

what the user sees) than field identifiers (i.e., what the P">fil<^ ^ t^f "s^r js invited to 

nroerammer sees) ^ j^* completed as the basis for 
T , . r^l. ^ * 1 . r i< completing an autofill profile. Upon clicking "yes" to this 

As shown in FIG. 2 one po^ible .mplementaUon for the 15 ^ ^^^^^ ^.^ ^^^^^^ 

mventive pnnciples ,nc udes a field matching fimctton 2QSa ^^^^ ^ ^ ,j ^^u.^^jj^n, i,,,, jjj^ particular 

that match^ field labels to those previously stored m data techniques (e.g., dialog boxes and the like) 

structute 206. According to jxrtain embodiments, a dictio- ^^^^ ^^^^^ ^^^^j^ exemplary only, and are by 

nary 207 is used to match field labels extracted from a form ^^^^ .^^^^^^^ j^^j, ^^^^ [j,^ [ 

to fields for which a previously stored profile value ,s 20 i„ inventive principles. 

avaUable As shown m FIG. 2 for example if the text stnng ' ^ ^ ^ 

"nombre" is extracted from a form near a text mput box, it . en Z r u ,u , u ■ ci/- « l 

can be matched against a dictionary ofcandidate field labels »° autoUl profile form such as that shown m FIG. 6 ^ 

in dictionary 207 and correlated to the field "name," for displayed to the user. As can be seen m FIG. 6 the autofill 
which a previously stored value exists in profile data storage 25 profile comprises a «t of commonly used fields on foms, 

-^a/ o *i .1, 1 ^ "Tnc» ^^r. Tr. such as the user s first name, middle mitial, last name, 

area 206. Consequently, the value JOE can be used to . . . j .-i i j 

1 / *u /u 1 . A f otri^o telephone numbers, and the like. In accordance with one 

populate the text box located on the form near the string . r.. ' • j . , * .u c u *u . 

f; ^ ^ aspect of the present invention, data values for the fields that 

were filled in by the user in FIG. 4 are extracted, matched 

A heuristics function 205fc can also be provided to accom- ^^^^^ ^^^^^^ ^lofiXe, and presented to the user 

modate differences m field labels (e.g., language differences ^^^^^ g ^^j^^^ ^^^^^^ p^^gj^ ^ partially 

for the same label) and to determine which of a plurality of created by an automatic matching process to save the user 

potential matches is likely the "best" match based on a inconvenience of manually entering fields that were 

various criteria such as field label placement and the like. As previously filled out in FIG. 4. AdditionaUy, the user has the 

described in more detail herein, for example, there may be ^p^.^^^ entering information for other fields for which no 

several text strmgs located near a text mput box, and one of ^^^^^^j^ ^^^^^ ^ ^^^^ Pj^ f^^. example, no 

the strings may be selected as the more likely field label ^^^^^ ^^^^^^ organization is yet known based on the 

based on its location and/or content. Other unplementaUons ^^^.^ -^p^^ q^^^^j. commonly used fields that could be 

are of course possible, and FIG. 2 is intended to lUustrate included on the autofill profile include fax numbers, 

only one of many such possibilities. spouse's names, mother's maiden name, credit card 

According to one embodiment, previously used data numbers, passwords, driver's license information, and the 

values are not used to populate a displayed form until the like. Once the user has entered any additional field values 

user affirmatively selects that choice. This avoids the pos- that he or she wishes, the user clicks on "OK" and the values 

sibility that a nefarious web site operator could extract appearing in the autofill profile are stored in a data storage 

previously used values that were used to populate a form area such as area 206 of FIG. 2. 

before a user had the opportunity to prevent the values from Assuming that the user had turned off the computer, 

being used. For additional security purposes, the values in returned a few days later, and again visited the same web site 

data store 206 could be stored in encrypted form in a pj^^ ^ jj^^ ordtT, according to the inventive principles 

protected area in client computer 204. As explained in more ^^iq user would be able to automatically populate the form 

detail herein, the values can be correlated or combined with ^^^ij^g "autofiir option as shown in FIG. 7. According to 

data from other sources, such as values used by the profile embodiment, a special "autofiU" option 701 is provided 

assistant, an entry for the user in an operating system address t of document browser 205 which, when selected by 

book, commonly used field identifiers, or field names from ^^^^ automatically populates the form for all fields that 

the vCard schema. can be matched to the previously stored user profile. In one 

FIG. 3 shows a form, as rendered in a document browser 55 embodiment, all field values that are automatically popu- 

window, which permits a user to place an on-line shopping lated in such a manner are highlighted using a different color 

order. As is conventional, the form comprises a plurality of (e.g., yellow) to indicate to the user that those fields were 

fields, each having a defined text box area as generally supplied from previously used values. This gives the user the 

depicted by reference numeral 302. A plurality of text labels opportunity to verify any fields that may need to be changed, 

generally indicated by reference number 301 is located in and permits correction of any fields that were incorrectly 

close proximity to the text boxes, such that each label matched. As shown in FIG. 7, for example, all fields that 

appears to be associated with a text box. As will be described were automatically populated using option 701 are shown in 

in more detail herein, the text labels do not actually form part a highlighted (shaded) color. Processing of the form would 

of a field, but can instead by defined by a web form designer occur normally; i.e., the user would click on "submit form" 
to appear near a corresponding text box. 55 to process the order. 

FIG. 4 shows the form of FIG. 3 after a user has entered Assuming the user visited a different web site having a 

his first name, last name, and various other data items different fonn, the user can click on the "autofill" option and 
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have all fields for which a match is found automatically ing on the design of the web page, the label associated with 

populated according to the inventive principles. Turning to a given data input box can be located in different regions on 

FIG. 8, for example, a different form is shown from that of the page; labels are not delerministically located based on 

FIG. 4. However, the form comprises some fields having the location of the text-input box. 

labels that match those previously stored in the profile. Upon 5 According to one embodiment of the present invention, an 

selecting "aulofill" option 803, those fields for which match- HTML-based page is scanned to identify the label most 

ing labels could be found are automatically populated with likely associated with a given data input box, and that label 

previously stored values from the autofiU profile, and are is used to match previotisly stored data values for a given 

highlighted (e.g., shaded or colored) to indicate those fields field. This approach differs from other approaches that rely 

that were automatically populated. As shown in FIG. 8, two lO on matching a field name attribute (e.g., "uscmamc" in the 

fields that were not automatically populated (and thus above example), which may vary significantly from web site 

requires new entry from the user) are the user's middle to web site. In other words, different web site designers may 

initial 801 and the user's favorite color 802. use widely varying attribute names to reference a person's 

According to one variation of the invention, upon detect- name (e.g., "uname", "usemame", "nm", etc.), but are likely 

ing that the user had entered a middle initial in text box 801 15 to use commonly known labels for the user to see (e.g., 

of FIG. 8, profile generator 205c again displays autofill "Your Name" or "Name'% Consequendy, according to one 

profile 203 (see FIG. 6), including the newly entered middle aspect of the present invention, labels associated with text 

initial, and ask the user to confirm that this newly entered entry fields are used as a basis for identifying previously 

value is to be stored and used in the future. Other embodi- populated fields. The principles of the invention can be 

ments and variations on this technique are of course pos- ^ applied not only to text input boxes, but also to checkboxes, 

sible. Nor is such a feature a necessary part of the invenUon. multiple choice elements, radio buttons, action buttons, 

. , . • f * u- • \ A <i^^A multi-line text areas, and other form constructs. 

Various techniques for matching previously used field _ .i *u • *• u u *u j ji ■* 

1 c ij 1 LI • j'ff 4 f ™^ Consequently, the invention should not be deemed limited to 

values with field labels appeanng on dilierent rorms will ^^^^ in ut boxes 

nowbeexplained,beginning with reference to FIGS. 9A and _ ^ .„ , * ... c u i l i . .i. 
no jj^AT u c •^ r^r..^ t-w irrr,..* ^5 OiiQ difBculty With matching field labels to those previ- 

9B. HTML web forms can include one or more text input , , . . c u i ifi • u^mt • . • n 

. .• 1 ui L w u- 1 L - u iu 11 ously used is that a field label in HTML is typically a 

boxes, cbckable buttons, multiple -choice checkboxes, pull- * . * *u * - * -u . ^ ei^/ « i^ 

, 1- 1 ui — * • 11 separate text Item that IS not an attribute of a field (i.e., a field 

downmenus, or clickable images, typically placed within an . _ , c • *u 

TTTT^xi e , m fin ♦ a " lat^^l cannot be referenced or detected by referencing the 

HTML <form> tag. Once a user fills out fields in a form, the ^ u i_ \ . u • r "Ij * a ■ l ui 

. „ ,( . u / ,u field by its name). One technique for identifying labels 

user typically clicks on a "submit button (or presses the ^j ^ucu-* u u e 

V \ / i_ r , *u L •* tC u 30 associated with fields is to search each HImL page tor 

return key) to submit the form to the web site. The browser ui i u i *u . i . ^ « i^ ^ . ; 

V V J 1 -1 J «u » *u possible labels that are located near the field data entry areas, 

collects the user-supplied values and sends them to the *1 . , , . . , u - i _j c i * * 

c e for roccssin ^ hierarchical order For example, text 

server or processing. appearing immediately to the left of a text input box would 

One particular type of form element is a text entry field, ^ g^^j candidate for the label. If no text appears 
which appears m a browser window as an empty box and 35 immediately to the left of a text input box, then text 

accepts a line of user input that becomes the value of the appearing directly above the text input box might be the next 

element when the user submits the form to the server. To ^^^^ candidate, and so on. Field attributes can also be used 

create a text entry field in a form, an <mput> form element ^^^^ determine what label is associated with a given field, 

is set to type "text" and assigned a name (e.g., ' username ). ^ potential label is found, it is compared to a dictio- 

Each field is usually accompanied by a label located near the of strings that represent common labels for autofill 

textbox.Forexample,thefollowingHTMLfragmentswhen p^^^j^ ^^^^ ^ ^^^^^ ^^^^^^ ^1^^ -^^^^ g^l^ 

rendered by a browser display two text entry fields with populated with the matched profile data and preferably 

accompanying labels "Your Name" and "Your Age" located highlighted with a different color to caution the user to 

to the left of the corresponding data entry area using a table double-check these values before submitting the form, 

construct. The corresponding form display would be similar pj^. ^^^^^ ^^^^^ ^^^^ executed to carry out a 

to that shown in FIG. 9A. method of the present invention. It is assumed that the user 

<forra method-POST action=."http://wwwjunk.com/ ^as clicked on an "autofill" button to automatically populate 

^ a displayed form. Beginning in step 1001, the next input 

<table border=0> field on the form is identified by, e.g., scanning the HTML 
<tr valign=top> 50 file until an input field is found. In step 1002, potential labels 

<td align=right>Your Name:</td> for that input field are identified, preferably using one or 

<td align=left><input type=text name=usemame more techniques discussed below, although these techniques 

size-32 maxlength-80> are by no means the only possible ones. 

</td> Id step 1003, the candidate label or labels are compared 
<tr valign-top> 55 to a dictionary of strings constituting commonly used labels 

<td align=righl>Your Age:</td> (e.g., name, address, etc.). The dictionary can be quite robust 

<td align=left>input type=text narae«agc size=3 and can include, for example, mappings among different 

maxlength=3> terms. For example, the dictionary can include multi -lingual 

</td> entries, such as "name", "nombre" (Spanish for "name"), 
</tr> 60 and "mci" (Japanese for "name"). Note that this can also 

</table> provide the ability to match fields on forms containing 

</form> mixed languages on the same form. Abbreviated versions of 

While often located to the left of the data input box, labels commonly used labels can also be included in the dictionary 

are sometimes located above, to the right of, below, or in (for example, "address", "addr", and "add"). Moreover, the 
other areas close to the data input area. For example, FIG. 65 dictionary can map vCard values to labels. The dictionary 

9B shows a different variation of the form in which the entries can also be arranged in a particular order to ensure 

labels appear above the pertinent data entry boxes. Depend- that matches near the top of the list are preferred over other 
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matches appearing later in the list. Alternatively, a "best fit" 
approach can be used by considering other field attributes 
and the degree of match between candidates and dictionary 
strings. In addition to vCard mapping, field labels can be 
matched against the emerging Electronic Commerce Mod- 
eling Language (ECML) standard, which is documented at 
www.ecml.org. 

In step 1004, if no match was found, the field is not 
populated and processing resumes at step 1001 imtil no more 
fields are found. If a match is found, then in step 1005 a 
previously stored data value is retrieved from the user 
profile, and in step 1006 the input field is populated with the 
located data value. Additionally, the input field is preferably 
changed to a dififerent color or shading to indicate to the user 
that the field has been automatically populated. 

As described above, certain heuristics can be used to 
increase the likelihood of a successful match in fields. In one 
embodiment, modified web browser 205 scans adjacent 
HTML in a document to identify field labels that are likely 20 
associated with a given input field. The scanning can be done 
by first attempting to identify a field from a table cell to the 
left of the current input element; then attempting a match 
from a table cell above the current element; then attempting 
a match from a table cell to the right of the current element; 25 
then attempting to match from text to the left of the input 
element, and so forth. In effect, the scanning can search 
surrounding text to identify the most likely label for the 
field. Although there are many different methods of search- 
ing for labels on a web page, the following pseudocode 30 
mock-up illustrates one possible approach for this type of 
search: 


12 


-continued 


function tableUfl( ) { 
// Get the ftmction argumcQL 
curElcmcnt - aigumcnts tOj; 
// Check the table cell to the left, 
if (cuiElement.parentElemcDt.tagName - 
curCcli - curElemcni,parentE]cment; 
if (cuiCcll-pareatElcmcnttagName — "TR" && 
cuiCtll.ccllInd€X > 0) { 
cuiRow B cuiCellpareatElcment; 
leftCcll ° curRow.cclIs[ciirCcll. cell Index - 1 J 
testString - IcftCclLinneiTcxt; 
Rcld » paticrnMatch(testString); 
return fteld; 


} 
} 

// Nothing matched, 
return false; 

} 

function tableUp( ) { 
// Get the function argument. 
curEtement = arguments [0]; 
// Chect the table cell above, 
if (curElcmentparentElementtagName — "TD*') { 
curCell - curElementparcntElement; 
if (curCe]l.parentElement.tagName — "TR") { 
cuiRow - curCell.parentElement; 
if (curRow.parentElement.tagNaine — "TBODV && 
curRow.rowIndex > 0) { 
curTBody - curRow.parentEle[nent; 
aboveRow - cuiTBody.rows[ curRow.ro windex - 1]; 
if (aboveRow.cclls. length > cutCcll.celllndcx) { 
aboveCcll - abovcRow.cellfi(curCell.ccll[ndex]; 
testString - aboveCell.inncrText; 
field B pa ttemMatcb (testString); 
return field; 

} 


} 


} 


<sciipt language="JavaScript**> 
function populate( ) { 
// Extract the fonns and cycle through them. 
docForms = iop.frames["inain"] .documcnl-forms; 
for (i=0; i < docForms .length; i++) { 
prev = " "; 
// Loop through the elements in the form. 
curForm = docForms [ij; 
for (j=0; j < curForm.eiements. length; { 
// Set the current element 
cur Element - curFonn.elements[j]; 
if (curElcmenLtype !- "text") continue; 
formFicld = false; 

// Try to identify field from left table cell. 
formField = tableLcft(curElement); 
if (formField) {set(cur Element, formField); continue;} 
// TYy to identify field from above table cell. 
formField - tableUp(curElement); 
if (formField) {set(curHement, formField); continue;} 
// Try to identify field from right table cell. 
formField - tableRight(curElement); 
if (formField) {8et(cur Element, formField); continue;} 
// Try to identify field from text to the left. 
formField - textLeft(cur Element); 
if (formField) {8et(curElement, formField); continue;} 
// Try to identify field from current cell. 
formRcld = inputNamc(curElcmcnt); 
if (formFicld) 5et(curElcment, formField); 


35 


// Nothing matched, 
return false; 


} 


^ } 

function inputName( ) { 
// Get the function argumenU 
curElement - arguments (01; 
// Check the name value for the current clement. 
ustStfing - cuiElcmcnLname; 
field « patternMaich(tefitString); 
return field; 

} 


40 


45 
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} 

function tableRight( ) { 
// Get the function argument 
curElement = arpiments [0]; 
// Check the table cell to the right, 
if (curElement. parentElementtagName = ^TD") { 
curCell = curElementparentEIement; 
if (curCell.parentElement. tagName *TR" && 

curCell. celllndoc < curCell. pa rentElement-cel Is. length - 1) { 
curRow » curCell.parentElement; 
rightCell - curRow. eel UlcurCtll. cell Index + IJ 
testString - rightCell.innerlfext; 
field » patteTnMatch(testString); 
return field; 

} 

} 

// Nothing matched, 
return false; 


} 

function textLeft( ) { 

// Get the function argument 

curElement ° arguments (01; 

// Check the text to the left. 
55 tr = top. frames! "main"! document body.crcateTextRange( ): 

tr.move'IbEIemcnlText(curElement); 

tr.collapse( ); 

tr.moveStart(**word", -3); 

tcatString - tr.tcxt; 

field « pattemMatch(tcstString); 
60 return field; 

) 

function set( ) { 
// Get the function arguments. 
curElement - arguments[0]; 
field - argumenta(i]; 
65 // Oct the value to use in setting the field, 
value - gctValue(ficld>, 
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-continued 


// Set the field with the value. 
curElc meat, value » value; 
//All done, 
return; 

funcUoD patlemMatch( ) { 
// Oct the function argument, 
testString » arguments (OJ 
// Look for a match. 

if (coatains(testStxing, "email")) return "email"; 
if (oontains(testString» "-mail")) return "email"; 
if (oontains(tcslString, "strccl'O && prev != "strcctAddr^) return 
"streetAddr"; 

if (contains(teslStiiQg, "address") && prev !- "slreclAddr") return 
"streetAddr"; 

if (oontains(teslString, "city")) return "dty"; 
if (oontains(teslString, "state'O) return "statcAbr"; 
if (oontains(testStiing, "province")) return "stateAbr"; 
if (oontains(teslString, "2ip'*)) return "zipCode"; 
if (oontains(tcslString, "posf^ && contains(tcstString, "code")) 
return "zipO>de"; 
if (contains(tcslString, "country")) return "country"; 
if (containsjtestString, "phone")) { 
if (contains(te8tString, "wort')) 

return "workPhone"; 
else return "homePhone"; 

} 

if (contains(testStTing, "company**)) return "company"; 
if (contains(testStTing, "organization")) return "company"; 
if (oontains(testString, "name")) { 

if (contains(tcstString, "user")) return false; 

if (contains(tcstString, "last, first")) return "fullBackward"; 

else if (contains (testString, "first")) return "firstNamc"; 

else if (contains (testString, "last")) return "lastNamc"; 

else return "full Forward"; 

} 

if (contains(tcstString, "fiisl") && prev != "firstName*') return 
"firstName"; 

if (co[itains(testStriQg, "last") && prev != "lastName") return 
"lastName"; 

if (co[itains(teslStiiDg, "middle")) return "middle"; 
if {contains(te5tString, "mi")) return "middle"; 
if (contains(teslString, "m.i.")) return "middle"; 
// No match was found, 
return false 

} 

function contains ( ) { 
// Get the function arguments, 
large » arguments[0].toIjoweiCase( ); 
small » arguments[l].toLowerCase( ); 
// See if the first argument string contains the second, 
for (m»0; m <= large.length - small .length; m++) { 
test - true; 

for (nsO; n < small. length; n++) { 

if (large. char A t(m+n) I«» smaIl.charAt(n)) { 
test = false; 
break; 


} 
} 

if (test) retum true; 

} 

// No match was found, 
return false; 

function getVfelue( ) { 
// Get the function argument 
field » argumontslO]; 
// Return the corresponding value, 
match " " "; 

if (field "firetName") match = " get from user profile ** 
else if (field — "lastName") match - * * get from user profile * ' 
else if (field -» "middle") match » ** get from user profile *• 
else if (field "fullForward") match » ** get from user profile 
else if (field -= "fullBackward") match - *• get from user profile • 
else if (field « "email") match - •* get from user profile 
else if (field "streetAddr") match - '* get from user profile 
else if (field "city") match - ** get from user profile ** 
else if (field — "staicName") match » get from user profile *• 
else if (field -= "stalcAbr") match = ** get from tiser profile •* 


else if (field «»= **zipCod6") match - " get from user profile 
else if (field "country") match - ** get from user proiUe 
5 else if (field -» "homcPhone*') match «• • * get from user profile ' ' 
else if (field — "workPhone") match = • • get &om user profile ' ' 
else if (field «=- "cornpany") match » ** get from user profile 
if (match != " ") prev - field; 
retum match; 


} 

10 </script> 

<button onclick""populatc( )">populate-<;/button> 


In the code fragments above, once a label is matched to a 
field, then the value for that field is extracted from the user 

15 profile and used to populate the input area. 

In addition to matching by labels as illustrated above, 
matches on fields can be made where a field includes a 
vCard attribute (e.g., vCard .FirstName). Consequently, for 
fields having vCard attributes, the vCard attribute would 

20 override any labels located near the field, and searching 
could be stopped without going further. 

Additionally, matches could be made based on field name 
attributes. Thus, for example, a field having a name attribute 
containing "fax" could be matched to a fax telephone 
number without regard to labels located near the text input 
box. Or, a field name could be combined with a label search 
to provide a high likelihood of success on a match (e.g., if 
a field contains the string "phone" and there is a label nearby 
containing the string "work", an inference might be drawn 
that the field is a work telephone number). 

It may be desirable to match labels according to a pre- 
determined order to facilitate matching. In this regard, 
matching may be attempted in the order of the following 
fields (i.e., for each label, a match will be attempted for 
variations of each field below in the order listed): 

35 (1) e-mail address 

(2) street address 

(3) city 

(4) state 

(5) zip code 
40 (6) fiill name 

(7) country 

(8) phone 

(9) work 

(10) company 
45 (11) name 

(12) user 

(13) first name 

(14) last name 

(15) last/first name 
50 (16) middle initial 

(17) fax number 
Thus has been described a method and apparatus for 

automatically populating a form comprising a plurality of 

fields. The foregoing explanation includes many variations 
55 and embodiments, and the invention is not intended to be 

limited to the specific details disclosed herein. 

Consequently, the invention is limited only by the claims 

appended hereto. 
What is claimed is: 
60 1. A computer-implemented method of entering 

information, comprising the steps of: 

(1) displaying on a computer screen a form comprising a 
first plurality of fields each including an initially empty 
data entry region into which a data value can be entered 

65 by a user; 

(2) manually entering data values into one or more of the 
initially empty data entry regions; 
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(3) indicating that the form should be submitted for 
processing; 

(4) in response to step (3), prompting the user to complete 
a profile comprising a second plurality of fields each 
including a data entry region; 5 

(5) in response to the user choosing to complete the profile 
of step (4), displaying the profile on the computer 
screen, wherein at least some of the second plurality of 
fields arc populated with data values copied from data 
values manually entered in step (2); jq 

(6) manually entering data values into one or more of the 
data entry regions of the second plurality of fields; and 

(7) storing the profile in a computer storage device for 
future use. 

2. The computer-implemented method of claim 1, 
wherein step (5) comprises the step of matching text labels 
appearing on the form with a dictionary of predetermined 
field labels and, in response to detecting a match, establish- 
ing a correspondence between one of the first plurahty of 
fields and one of the second plurality of fields and copying 
the data value entered in the data entry area of the first ^ 
corresponding field into the data entry area of the second 
corresponding field. 

3. The computer-implemented method of claim 2, 
wherein step (5) comprises the step of comparing text labels 
with predetermined text labels spanning a plurality of Ian- 25 
guages. 

4. The computer-implemented method of claim 2, 
wherein step (5) comprises the step of searching around each 
empty data region on the form in a hierarchical directional 
order to match the text labels. 

5. The computer-implemented method of claim 1, 
wherein step (1) comprises the step of using a web browser 
to render HTML statements on the computer screen, and 
wherein step (5) comprises the step of scanning the HTML 
statements to identify matching fields. 

6. The computer-implemented method of claim 1, further 
comprising the steps of: 

(8) displaying a second form comprising a third plurality 
of fields at least some of which differ from those 
appearing on the form and those appearing on the *° 
profile, wherein each of the third plurality of fields has 
associated therewith a data entry region; 

(9) in response to the user indicating that the second form 
should be automatically populated, retrieving one or 
more data values from the profile and using the one or 
more retrieved data values to populate at least some of 
the data entry regions associated with the third plurality 
of fields; and 

(10) highlighting the data entry regions populated in step 
(9). 

7. The computer-implemented method of claim 6, 
wherein step (9) comprises the step of matching text labels 
appearing on the second form with prestored text su-ings to 
identify matching data values in the profile. 

8. A computer readable medium storing computer execut- 
able instructions that perform the steps of: 

(1) displaying on a computer screen a form comprising a 
first plurality of fields each including an initially empty 
data entry region into which a data value can be entered 
by a user; 
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(2) manually entering data values into one or more of the 
initially empty data entry regions; 

(3) indicating that the form should be submitted for 
processing; 

(4) in response to step (3), prompting the user to complete 
a profile comprising a second plurality of fields each 
including a data eaU^y region; 

(5) in response to the user choosing to complete the profile 
of step (4), displaying the profile on the computer 
screen, wherein at least some of the second plurality of 
fields are populated with data values copied fi'om data 
values manually entered in step (2); 

(6) manually entering data values into one or more of the 
data entry regions of the second plurality of fields; and 

(7) staring the profile in a computer storage device for 
future use. 

9. The computer-readable medium of claim 8, wherein 
step (5) comprises the step of matching text labels appearing 
on the form with a dictionary of predetermined field labels 
and, in response to detecting a match, establishing a corre- 
spondence between one of the first plurality of fields and one 
of the second plurality of fields and copying the data value 
entered in the data entry area of the first corresponding field 
into the data entry area of the second corresponding field. 

10. The computer-readable medium of claim 9, wherein 
step (5) comprises the step of searching around each empty 
data region on the form in a hierarchical directional order to 
match the text labels. 

11. The computer-readable medium of claim 9, wherein 
step (5) comprises the step of comparing text labels with 
predetermined text labels spanning a plurality of languages. 

12. The computer- read able medium of claim 8, wherein 
step (1) comprises the step of using a web browser to render 
HTML statements on computer screen, and wherein step (5) 
comprises the step of scanning the HTML statements to 
identify matching fields. 

13. The computer-readable medium of claim 8, wherein 
the computer executable insUiictions further perform the 
steps of: 

(8) displaying a second form comprising a third plurality 
of fields at least some of which differ from those 
appearing on the form and those appearing on the 
profile, wherein each of the third plurality of fields has 
associated therewith a data entry region; 

(9) in response to the user indicating that the second form 
should be automatically populated, retrieving one or 
more data values from the profile and using the one or 
more retrieved data values to populate at least some of 
the data entry regions associated with the third plurality 
of fields; and 

(10) highUghting the data entry regions populated in step 

14. The computer-readable medium of claim 13, wherein 
step (9) comprises the step of matching text labels appearing 
on the second form with prestored text strings to identify 
matching data values in the profile. 
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